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I.  Introduction 


Consider  the  0-1  integer  program 

n 

(p)  max 


E c.x. 
j=l  J J 


s.t 


£ ax.  or  b.  i ^ M = f 

j-i  lj  J - 1 


x^=0  orl  j ( N = (1.,  .n] 


We  will  denote  by  U the  universe  of  problems  of  the  type  p.  For 
p 6 U let 

n < 

F (p)  = fx  € f 0,1]  | £ a x & b i € M] 

j = l J J " 

Z (p)  = max  cx 
x <E  F (p) 


0 (p)  = (x  g F (p)  | cx  = Z(p) 

be  the  feasible  set,  optimal  value,  and  optimal  set  for  p respectively. 

Let  now  PC  U be  a particular  class  of  problems  (say  knapsack,  travelling 

salesman,  set  covering,  etc.)  and  let  A be  an  exact  algorithm  for  P,  i.e.  such 

that  given  any  problem  p(  P,  A finds  an  optimal  solution,  x £ 0(p).  The  work 

of  Cook  [ 1 ],  Karp  [ 6],  and  others  indicates  that  for  many  interesting  classes 

of  problems  P (including  the  ones  just  mentioned),  one  is  unlikely  to  find  an 

exact  algorithm  A which  will  be  able  to  solve  every  problem  p f P without, 

on  occasion,  investing  a computational  effort  which  is  essentially  equivalent 

N 

to  enumerating  a significant  fraction  of  the  vertices  of  {0,1]  . It  is  also 
well  known  that  practitioners  of  Integer  and  Combinational  Programming  have 
often  been  confronted  with  individual  and  classes  of  problems  which  defy 
solution  in  "reasonable"  amounts  of  time.  Thus,  it  is  not  surprising  that 


a considerable  amount  of  effort  has  been  recently  devoted  to  the  analysis  of 
approximate  (heuristic)  algorithms,  e.g.,  [3],  [5],  [7],  In  contrast  to  an 
exact  one,  an  approximate  algorithm  finds  a feasible  solution,  x £ F(p),  but 
not  necessarily  an  optimal  one.  The  effectiveness  of  such  an  algorithm  with 
respect  to  the  problem  p must  then  be  evaluated  along  two  attributes:  the 
computational  effort  involved  in  finding  x and  the  "quality"  of  x as  an  ap- 
proximation for  the  optimal  solution.  It  is  this  second  attribute  which  is 
our  domain  of  interest  in  this  paper. 

It  will  be  more  convenient  to  measure  the  "quality"  of  a given  solution  on 
a negative  scale,  i.e.,  we  will  be  concerned  with  measuring  the  "distance",  so 
to  speak,  of  x from  an  optimal  solution.  We  will  call  this  "distance"  the 
Error  associated  with  x and  any  function  which  attempts  to  quantify  it,  the 
error  evaluation  of  x relative  to  p,  denoted  Ep(x,p).  The  main  purpose  of  this 
paper  is  then  to  investigate  and  characterize  functions  which  can  serve  as  error 
evaluators.  (It  should  be  noted  that  this  discussion  has  nothing  to  contribute 
towards  the  resolution,  and  is  independent  of,  the  deeper  question  associated 
with  aggregating  the  characteristics  of  a given  algorithm  with  regard  to  the 
individual  problems  p ( P,  into  a measure  of  effectiveness  relevant  to  the 
class  P as  a whole.  It  is  only  after  a proper  measure  is  established  for  the 
individual  problems  that  one  needs  decide  whether  averaging  or  worst  case 
analysis  is  the  meaningful,  tractable,  practical,  etc.  method  of  aggregation.) 

The  organization  of  the  paper  is  as  follows.  In  section  II  we  present  a 
set  of  properties  to  be  expected  from  an  error  evaluating  function  and  examine 
them  vis-a-vis  the  traditional  "percentage  error"  measure.  In  section  III  we 
show  that  functions  which  satisfy  the  conditions  introduced  in  section  II  do  in- 
deed exist.  We  then  discuss  the  possible  uniqueness  of  such  functions.  Finally, 
in  section  IV,  we  conclude  with  several  examples  of  error  evaluating  functions 
for  particular  classes  of  problems. 


II.  Basic  Properties  of  Error  Evaluating  Functions 

A given  error  evaluation  function  (u...F.)  is  not  expected  to  perform 
properly  unless  it  satisfies  a set  of  reasonable  properties.  In  this  section, 
we  attempt  to  propose  such  a set.  To  simplify  notation,  we  will  delete  the 
parameter  P from  our  notation  for  Ep(x,p). 

The  first  two  properties,  A and  B,  require  hardly  a comment.  An  E.E.F. 
should  certainly  be  a monotone  (at  least  weakly)  function  of  the  solution 
value  cx.  Also,  the  error  associated  with  an  optimal  solution  can  be  conven- 
iently set  to  zero.  Thus: 


Condition  A.  Let  p £ P,  x ,x  £ F(p) 

12  1 2 
cx  > cx  =>>  E (x  ,p)  < E(x  ,p). 


Condition  B.  Let  p £ P 


x £ 0(P)  =>  E(x,p)  = 0 


An  E.F.F.  often  used  by  researchers  is  given  by 
E^x.p)  = (cxP  - cx)  / |cxP| 
where  xP  £ 0(p)  is  an  optimal  solution  for  p. 

If  P is  such  that  for  every  p(  P,  cxP  / 0,  then  E^x.p)  is  easily  seen  to 
satisfy  conditions  A and  B.  So,  of  course,  do  the  functions 
E2(x,p)  = cxP  " cx 

E3(x,p)  = (cxP  - cx)/  | cj 

where  c^  is  the  first  (index  wise)  non  zero  entry  in  the  objective  row. 

E (x  d)  = f°  if  X € °(P) 

V >P)  [1  otherwise 


Let  us  examine  E2(x,p)  more  carefully.  One  finds  it  hardly  surprising 


7 


• s . 


that  this  function  is  rarely  proposed  as  an  E.E.F.  Intuitively,  one  feels 
that  the  difference  in  the  objective  values  is  meaningless  by  itself  and  can 
be  gauged  only  relative  to  some  other  normalization  factor.  Put  differently, 
E2(x,p)  is  unacceptable  since  its  value  can  be  arbitrarily  manipulated  by  changes 
of  units  in  the  objective  function.  Likewise,  E^(x,p)  can  be  rejected  on  the 
grounds  that  its  value  depends  on  the  particular  order  in  which  the  variables 
are  indexed.  Several  authors  have  observed  that  similar  objections  can  be  often 
raised  with  respect  to  E^.  For  instance,  Cornuejols,  Fisher  and  Nemhauser  [ 2 ] 
examined  a set  of  problems  of  the  plant  location  variety.  Such  problems  are 
known  to  be  invariant  under  addition  of  certain  row  constants  to  the  cost  matrix. 
Yet,  as  noted  in  [ 2 ] , the  value  of  the  error  for  a given  non-optimal  solution, 
as  measured  by  E^(x,p),  does  change  under  such  transformations.  To  overcome 
this  difficulty,  an  E.E.F.  is  proposed  in  [ 2 ] which  is  invariant  under  the 
particular  cost  transformations  in  question.  An  identical  problem  arises,  of 
course,  when  one  applies  E^(x,p)  as  an  E.E.F.  for  transportation  problems, 
general  (as  opposed  to  Euclidean)  travelling  salesman  problems  and  the  like 

A related  objection  to  E^(x,p)  is  brought  forward  by  Korte  in  [ 7 ] . 

It  is  well  known  that  a given  optimization  problem  can  often  be  presented 
as  a maximization  or  minimization  problem,  the  two  versions  being  virtually 
equivalent.  Yet  the  equivalent  does  not  always  carry  over  to  the  corresponding 
approximation  problems.  For  instance,  the  value  assigned  by  E^  to  a given 
non-optimal  solution  for  the  knapsack  problem  depends  on  the  version  chosen. 

Even  worse,  the  computational  complexity  of  the  two  versions  may  be  different. 

This  is  the  case  for  the  travelling  salesman  problem  where  the  maximization  version 

can  be  solved  by  a guaranteed  performance  polynomial  algorithm  [7],  yet 

the  problem  of  solving  the  minimization  version  to  any  accuracy  is  NP  hard  [8], 

As  Korte  suggests,  this  behavior  casts  some  doubts  as  to  the  appropriateness 
of  measuring  errors  by  E^(x,p)  for  this  class  of  problems. 

Condition  C below  is  designed  to  take  account  of  the  ideas  brought  forward 
in  the  foregoing  discussion.  Basically,  it  enforces  equality  of  the  values 


assigned  by  an  E.E.F.  to  corresponding  solutions  in  problems  which  are  obvious- 
ly 'equivalent',  but  may  appear  different  when  viewed  superficially.  By  equivalent 
we  mean  here  problems  which  are  related  to  each  other  by  permutations  of  the 
rows  or  columns,  complementing  of  variables,  and  trivial  algebric  manipulations  of 
the  objective  function  and  constraints.  But  first  we  must  digress  for  some 
notation  and  definitions. 

For  a problem  p f P,  let  us  denote 
c =(c^) 

A =(a^with  a\  (a^)  the  jth  column  (ith  row)  of  A. 
b =(b.) 

p = <p  .)  is  'k"  or  "="  as  the  case  may  be. 

We  then  have  the  more  compact  representation: 

(p)  max  cx 

s.t.  Axpb 
x € {0,1}  N 

On  occasion,  we  will  refer  to  p as  (c,  A,  p , b) . 

For  definitions  1-4  below,  let  p = (c.  A,  p,  b)  £ U,  x £ F (p) . 

Definition  1 Let  cp  be  a column  permutation  operator.  Bycpp,  we  mean  the 
problem  p'  = (cpc,  cpA,  p,  b) . x'  = cp  x is  defined  accordingly. 

Definition  2 Let  tt  be  a row  permutation  operator.  By  TTp  we  mean  the  problem 
p'  = (c,  n A,  np,  rrb).  By  nx  we  simply  mean  x. 

S 

Definition  3 Let  S C N.  We  will  denote  by  D the  operator  which  complements 

the  variables  x,,j  P S,  i.e.: 

J 


1 


D (p)  = p'  = (c' , A' , p , b') 


• r s j e s 

Cj  "jcj  j i S 


i:  =h j 

J [a.  j 


b.  = b - £ a. 


1 1 j<ES  ^ 


_S  , 

D x = x is  given  by 


, f 1-*  j € s 

V'l  HS 


Unless  specifically  required,  we  will  suppress  the  parameter  S and  refer  to  the 


operator  as  D. 


Defini  tion  h Let  M = { i g H p.  is  "=" } be  the  index  set  for  the  equality  rows 


of  p.  Let  also 


a ....  a € R+  be  any  set  of  M+l  positive  constants 


o M Mp 

3 , ...  p € R be  any  set  of  M-l-1  real,  M dimensional  vectors  such 

P 


that  the  matrix  B ^ + a *6  . . if  M , j f M , with  6 . . 

J ij  P P ij 


non  singular.  Then 


_ fO  i * j 
U |l  i = j 


a^p  = p’  = (c’’  A’’  p>  b,) 


Where 


i o o 

c=ac+£0  a 

j J i€M  1 ij 

P 


i k k 

a k,  =<X  a.  . + £ 3.  a.. 

kj  kj  i£M  1 ^ 

P 


b't  *1  *i\ 
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Obviously, 


G „ x = x 

a,£ 


Again  we  will  usually  suppress  the  parameter  cl  ,f3  and  refer  to  the  operator  as  G. 

12  1 

Definition  5 Let  p € U,  and  let  p £ U be  a problem  obtainable  from  p by  a 

sequence  of  tt  , cp  , G and  D operators  applied  in  any  order.  We  will  then  say 

1 2 1 2 11  2 
that  p is  reducible  to  p , denoted  p -*  p . Also  let  x £ F(p  ),  and  let  x 

be  obtained  from  x^  by  an  identical  sequence  of  operators.  Then  we  say  that 

2 12  12 
x corresponds  to  x , x -*  x , relative  to  p -«  p . One  can  easily  verify  that  in 

2 2 

this  case,  x £ F(p  ). 

We  are  now  ready  to  define  the  third  property  of  E.E.F's: 

Condition  C.  Let  p\p  £ P such  that  p*  -*  p . Further,  let  x^  £ F(p^)  and  let 

2 12 

x correspond  to  x^  relative  to  p -»  p . 


11.  2 2 
E(x  ,p  ) = E(x  ,p  ). 


8 


III.  Further  Properties  of  Error  Evaluating  Functions 

Condition  C cuts  significantly  the  number  of  available  E.E.F.'s.  For 
instance,  E^(x,p)  is  excluded  for  several  frequently  occurring  classes  of 
problems  P.  Naturally,  for  a given  P,  the  questions  arrise  as  to  the  existance 
and  uniqueness  of  an  E.E.F.  which  satisfies  conditions  A-C.  In  this  section,  we 
address  these  questions.  The  first  is  settled  in  the  affirmative  (Theorem  1).  The 

second  in  a qualified  affirmative  (Theorems  2 and  3).  We  open  the  discussion 
with  some  technical  remarks  concerning  the  relation  -*  . 

Remark  1 It  can  be  easily  established  that  any  of  the  row  type  operators, 

Ti  and  G,  commutes  with  the  column  type  operators,  ep  and  D.  On  the  other  hand, 

TT  and  G do  not  commute,  nor  do  cp  and  D.  However,  the  following  weaker  form  of 
commutativity  does  hold: 

(i)  For  every  column  permutation  operator  cp  and  every  subset  S c N,  there 

exists  s'c  N such  that 

s s 1 

(1)  cp  D = D cp 


The  property  goes  the  other  way  too,  i.e.,  for  every cp  and  every  S c N,  there 
exists  SC  N such  that  (1)  holds. 

(ii)  For  every  row  permutation  operator  rr  and  every  set  of  constants  a * 3 as 
defined  in  condition  C,  there  exists  a set  of  a',  8'  such  that 


(2) 


TIG  „ = G I i TT 

a *8  a 8 


Again,  the  statement  is  true  in  the  reverse  direction  too,  i.e.,  for  every  set 
a *8 ' there  exists  a seta*  3 such  that  (2)  holds. 

It  can  also  be  easily  verified  that  any  sequence  of  n type  operators  is  in 
itself  a tt  type  operator,  cp  , G and  D operators  behave  likewise.  Also,  each 
of  the  four  types  of  operators  contains  an  identity  operator.  Thus,  definition  5 


can  be  stated  equivelently : 


- 9 - 

® 1 2 12 

Definition  5 : Let  p , p £ U.  Then  p -•  p iff  there  exists  tt  , G,  cp , D such  that 

2 1 
P = n 9 G D p 

12  12 

In  such  a case,  we  say  that  x -•  x relative  to  p -*  p if: 

2 „ 1 
x =cp  D x 


Proposition  1 

The  relation  -•  among  problems  is 

12  21 

(l)  Symmetric,  i.e.,  p -•  p =?>  p -•  p 

1 2 2 3 1 3 

(ii)  Transitive,  i.e.,  p -*  p , p -♦  p =s>  p -«  p 

(iii)  Reflexive,  i.e.,  p -*  p 

A corresponding  set  of  statements  can  be  made  with  respect  to  the  relation  among 

12  3 

the  corresponding  solutions  x ,x  ,x  . 

Proof : 

(i)  follows  from  the  fact  that  each  operator  has  an  inverse, 

(iii)  follows  from  the  fact  that  each  operator  type  has  an  identity,  and 

(ii)  follows  by  the  same  reasoning  used  in  the  paragraph  preceding  definition  5' 

Q.E.D. 

Proposition  2 

Let  Pc  U be  arbitrary.  Then  P has  a unique  partition: 

P = U P such  that 
X € K * 


P^  0 P^  =0  for  X + X 
1 2 

Let  P 6 P„ > P € P ' 

X X 


P <*>  X =X 


Proof 


immediate  from  proposition  1. 


C.E.D. 


We  will  refer  to  the  Px's  as  the  equivalence  classes  within  P.  We  will  call  two 
individual  problems  equivalent  if  they  belong  to  the  same  equivalence  class. 


10  - 


Remark  2 


Let  p*  £ P,  i = 1,2  such  that 


p . The  transformations  involved  in  this 
relation  are  not  necessarily  unique.  It  follows  then  that  there  may  be  more 


2 2 12  12 
than  one  distinct  x £ F (p  ) such  that  x -»  x relative  to  p -*  p . For  example,  let 


(P1) 


max  x,  + x„ 
1 2 


s.t. 


x^  + 2x2  < 2 


2x^  + x2  < 2 


xlSx2  = o or  1 


and  let  x^  £ F(p*)  be  (1,0).  Consider 


<P2) 


max  2x, 

+ 2x 

1 

2 

s.t. 

X, 

+ 2x„  < 2 

1 

2 - 

2 x- 

+ x„  < 2 

1 

2 - 

xi 

,x2  = 0 or 

. 2 

: that  p 

S 1 


S - 0;  a -2 ; a - 1»  i - 1>2;  tt  and  9 are  the  identity  permutations.  (Since  in 


this  case  Mp-0,  0 -0  too.)  The  solution  which  corresponds  to  x^  is  then  x^=cpDx^=  (1,0] 

1 2 

But  another  set  of  possible  operators  which  transforms  p to  p is  given  by 
2 


p =TT  9 G aD~p 


< » „ 1 
G . r 
a 

where  G and  D are  as  before,  but  where  9 permutes  the  columns  of  p^  and  tt 


permutes  the  rows.  Under  these  operators,  the  solution  which  corresponds  to 

1 12  2*1  2 
x relative  to  p p is  y =cp'x  = (0,1)  £ x . However,  there  is  a considerable 

similarity  between  the  feasible  sets  of  equivalent  problems  as  the  following  three 

propositions  indicate. 

Proposition  3 

For  i = 1,  2 let 


i . i i i i 
P = (c  ,A  ,p  ,b  ) £ P 


/ 


-//■ 


1 2 

be  such  that  p _ p . In  particular,  let 

2 „ ^ 1 
p = TT  cp  G D p 

1 2 

For  an  arbitrary  x £ F (p  ) let  y £ F(p  ) be  given  by 


Then, 


y = <p  D x 


2 1 

c y = k^c  x + k2  where  k^  > o,  k2  depend  on  the  transformations 


TT , cp  , G and  D,  but  not  on  the  particular  solution  x. 

Proof  Let  us  examine  the  effect  of  each  of  the  transformations  in  question  on 
the  value  of  the  objective  function  of  the  corresponding  solutions. 

• S i s 

(i)  Let  p = (c,  A, p , b),  p'  = D p,  x £ F(p),  x = D x as  in  definition  3. 

c'x'  = Z c'.x.  = Z c.x.  + Z (-c.)  (1-x.) 
j€N  3 J jg  ^SJ  3 j<=  S 3 

= Z c.x  - Z c.=  k. c + k_ 

j€N  J J ies  J 11  2 

where  k.  = 1 and  k„  = -Z  c . do  not  depend  on  x. 

1 2 jes  J 

(ii)  Let  p = (c,A,p  , b),  p'  = G p,  xf  F(p),  x'  = x as  in  definition  4. 

a jS 

c'x'  = Z c'.  x'  = Z (x°c-  + £ p.°a..)x. 
j€N  J j j€N  J i£Mp  1 1J  J 

= a°  £ c x.  + Z ? ° Z a.  .x  = 
j£N  J 3 i£Mp  j€N  1J  J 


a°cx  + Z 0°b.  = k cx  + k2 

i £ M 11 
P 

where  k =a°  >o,  and  k = Z p°b.  do  not  depend  on  x. 

1 i01  1 1 

P 

(iii)  It  can  be  easily  verified  that  n and  cp  do  not  have  any  effect  on  the 


objective  value. 


Q.E.D. 


Proposition  4 

Let  p1  £P,i  = 1,  2 such  that  p1  -♦  p2.  Let  Z*  > Z*  ...  > be  the  distinct 

values  of  [cix,|xg  F(p*)}  , with  the  corresponding  solution  sets  N*,  1<  j<  r(i), 

and  multiplicities  n^  = ] n|  ] . Then 


/ 


12 


* 


(i)  r(l)  = r (2)  = r 

/..si  2 . . . 

(li)  ».  = n . for  j = 1,  ...r 
J J 

2 1 1 2 
Proof  Letp  =n«pGDp  be  a particular  transformation  of  p to  p • Let 

11  12  12  1 

x ,y  £ F (p  ),  x =ip  D x , y = cp  D y . It  follows  directly  from  proposition 

1111  2222  1 2 
3 that  c x > c y <=?>  c x >cy.  Thus,  x £ <=>>  cp  D x = y £ . Since 

1 2 

(p  D is  a one  to  one  transformation  of  F(p  ) onto  F(p  ),  we  have  that 
1 2 

n*  = n^.  To  complete  the  proof,  we  have  to  repeat  the  argument  with  respect 


to  N j , j = 2,...r(l). 
Proposition  5 


Q.E.D. 


1 2 1 2 

Let  p ,p  £ U be  such  that  p -*  p and  assume  that  there  is  more  than  one  set 

of  transformations  which  induce  this  relation  . In  particular, 

2 _ 1 
p = TT  cp  G D p 


but  also 


2 i i i i 1 

p = TT  cp  G D p 


l r 2 12.11I 

x € F(p  );  X =tpDx;y  =ip  D x . 


2 2 2 2 

c y = c x 

Proof  Using  the  notation  of  proposition  4,  let  x*  £ N * for  some  1 < j < r. 

Then  by  the  same  logic  applied  in  the  proof  of  that  proposition,  we  have  that 

2 2 2 2 
x £ N.  for  the  same  index  j.  But  then  also  y £ N . by  the  same  argument.  Thus, 

2 2 2 2 2 

c. x = c y = Zj  Q.E.D. 

We  are  now  ready  to  address  the  existance  and  uniqueness  issues  for  those 
E.E.F.'s  which  satisfy  conditions  A-C  for  a given  class  P.  We  will  call  such 


E.E.F.  s 


for  P.  It  should  be  noted  that  this  concept  is  defined  relative 


to  a given  class  P since  condition  C is  in  effect  among  problems  within  P only. 

Thus,  it  is  entirely  possible  to  have  an  E.E.F.  which  is  proper  for  one  class  of 
problems  P,  but  improper  for  another.  Indeed,  on  occasion,  one  may  wish  to 
override  the  equivalence  between  problems  implied  by  the  relation-*  . For  instance, 
one  may  feel  that  a standard  transportation  problem,  whose  constraints  have  been 
manipulated  by  G and  D type  transformations,  may  not  be  easily  recognized  as 
a transportation  problem  and  therefore  need  not  necessarily  be  evaluated 
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1 

identically.  The  same  distinction  can  be  made  between  general  and  Euclidean 
cost  traveling  salesman  problems,  etc.  Such  cases  can  be  conveniently  handled 
by  a proper  choice  of  the  class  P.  (One  would  choose  P to  be  the  class  of 
standard  format  transportation  problems  in  the  first  case. 

Euclidean  traveling  salesman  problems  in  the  second).  Naturally,  the  larger 
is  P,  the  fewer  are  the  functions  which  are  proper  for  it.  One  may  then  be 
concerned  that  if  P is  large  enough,  a proper  E.E.F.  may  not  exist.  This 
possibility  is  ruled  out  by 

Theorem  1 For  every  class  P c U,  there  exists  a proper  E.E.F. 

Proof  Let  p(  P and  let  z^>  Z2***  > zr  be  the  distinct  values  of  the  objective 
function  with  the  corresponding  solution  sets  N^,  N^,  . ..,  N^_.  Consider  the  E.E.F. 
E*(x,p)  = j-1  if  x £ Nj,  i.e.  if  cx  = 

It  can  be  easily  seen  that  E*  (x,p)  satisfies  conditions  A and  B.  The  fact 
that  it  satisfies  condition  C as  well  follows  from  propositions  4 and  5.  Q.E.D. 

Can  we  expect  uniqueness  of  E*  as  a proper  E.E.F.  for  P?  The  following 
Theorem  shows  that  every  proper  E.E.F.  yields  in  a natural  way  a vast  number  of 
new  proper  E.E.F. 's. 

Theorem  2 Let  E(x,p)  be  a proper  E.E.F.  for  P.  Let  P , ...P  be  the  partition 

1 x 

of  P to  equivalence  classes  established  in  proposition  2.  For  each  1<  k<  X, 

let  b (•)  be  any  non-decreasing  function  of  one  argument  such  that  b (0)  = 0. 
k k 

Then 

E'(x,p)  = bk(E(x,p))  for  p(  Pk 
is  also  a proper  E.E.F. 

Proof  We  have  to  show  that  E'(x,p)  satisfies  conditions  A,  B and  C.  That  it 
satisfies  A follows  from  the  facts  that  E(x,p)  satisfies  A and  that  ^(0  is 


“/¥- 

monotone.  E'(x,p)  satisfies  B since  E(x,p)  does  and  since  ^(0)  = 0.  It  re- 

1 2 

mains  to  be  seen  that  E'(x,p)  satisfies  condition  C.  Let  then  p , p £ P,  such 
12  12 

that  p p . It  follows  that  p ,p  £ for  some  1 < k < X . Hence,  the 

same  function,  bk(.),  is  used  to  define  E'(x,p)  for  both  p^  and  p2 . Let  x*  -•  x2 
1 2 

relative  to  p -»  p . Since  E(x,p)  is  proper 

11  2 2 
E(x  ,p  ) = E(x  ,p  ) 

but  then 

E'Cx1,?1)  = b^ECxVp1))  = bk(E(x2,p2)  = e’(x2,P2)  Q.E.D. 

In  light  of  Theorem  2,  the  uniqueness  of  proper  E.E.F.  is  ruled  out.  It 
is  interesting  to  note,  however,  that  every  proper  E.E.F.  is  accounted  for  by 
applying  a monotone  transformations  of  the  type  referred  to  in  Theorem  2 to  E*(x,p), 
the  E.E.F.  referred  to  in  the  proof  of  Theorem  1.  Thus,  as  the  following  theorem 
indicates,  we  have  uniqueness  of  E*(x,p)  up  to  monotone  transformations: 


Theorem  3 Let  Pc  U and  let  E(x,p)  be  a proper  E.E.F.  for  P.  Then  E(x,p) 
can  be  represented  as 

E(x,p)  = bk(E"(x,p)) 

where  E*(x,p)  is  the  E.E.F.  referred  to  in  the  proof  of  Theorem  1 and  bk(-)  are 
the  monotone  functions  referred  to  in  Theorem  2. 


Proof  The  proof  is  by  constructing  the  functions  b.  (•).  1<  k<  X.  For  every 

T K 

1 < k < A , let  us  choose  a representative  problem  p £ arbitrarily.  Let 

lc  lc  lc  lc 

N,  ,N  ...N  ...  be  the  partition  of  the  feasible  set  of  p according  to  the 

objective  value  as  in  the  proof  of  Theorem  1.  For  each  1 <_  j < r(k),  choose  x"1 

j k 

arbitrarily  such  that  xJ  £ . By  definition 

E*(xj,pk)  = j - 1 

Let  E(xj,pk)  = akj 
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Since  E(x,p)  is  proper,  is  well  defined,  in  the  sense  that  its  value  depends 

on  the  class  ?k  and  on  the  index  j,  but  not  on  the  particular  representative 

k Ik  L- 

problem  p £ and  solution  x £ Nj  chosen.  Since  E*(x,p  ) ranges  over  the 

integers  0 - r(k)  - 1 only,  it  suffices  to  define  on  those  values.  Define 


bk(j-D  =ajk  for  j = 1,  r (k) 

It  then  follows  by  construction  that 

E(x,pk)  = bk(E*(x,pk)). 

Since  pk  £ was  chosen  arbitrarily,  we  have  that 
E(x,p)  = bk(E*(x,p))  for  p £ Pk« 

It  remains  to  be  seen  that  bk(*)  does  have  the  properties  asserted.  Both  follow 
the  fact  that  E(x,p)  is  proper.  Thus,  t>k(0,  k = 1...1  is  monotone  since 


condition  A implies  that  the 
for  x £ 0(pk),  E (x,pk)  = 0. 


akj 


are  monotone  with  j . 


Also,  condition  B implies  that 


Thus,  ax  k = 0 and  hence  bk(0)  = 0,  k = 1...X  Q.E.D. 
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IV.  Linear  E.E.F.'s:  Examples 

The  construction  of  Theorem  3 accounts  for  all  the  proper  E.E.F.  's  for 
a given  class  P.  However,  this  construction  is  intractable  computationally, 
since  it  calls  for  determination  of  the  complete  solution  structure  (i.e., 
the  sequence  of  the  z^'s),  and  for  the  identification  of  the  appropriate 
monotone  function,  b (•),  applicable  to  the  problem  in  question.  In  this 
section,  we  examine  some  1 inear  proper  E.E.F.'s  which  can  be  calculated  on 
the  basis  of  considerably  less  information.  One  type  of  such  functions 
uses  the  concept  of  a reference  point,  brought  forward  by  Cornuejols,  Fisher 
and  Nemhauser  [ 2 ] . Because  of  the  vast  richness  of  the  family  of  proper 
linear  E.E.F.'s,  the  section  is  intended  to  be  a sketchy,  rather  than  syste- 
matic, discussion  of  some  examples. 

By  a linear  E.E.F. , we  mean  a function  of  the  form 

E(x,p)  =3  - a cPx 

P P 

where  cP  is  the  cost  vector  for  p,  P and  a > 0 are  constants  which  depend 

p p ~ 

on  p,  but  not  on  x.  We  will  restrict  our  attention  to  the  non-trivial  case 
CLp  > 0.  From  condition  B,  we  get  an  alternative  expression  for  E 


denoting 


we  have 


E(x,p)  = ap(z(p)  - cpx) 

Y„  ' I/O, 

E(x,p)  = (z(p)  - cPx)/y  . 


We  refer  to  Yp  as  the  normalization  factor.  In  order  that  E(x,p)  be  proper, 
the  Yp's  should  be  chosen  in  such  a way  that  condition  C is  enforced.  This 
can  usually  be  done  in  several  ways  as  the  following  proposition  implies. 


Proposition  7.  Let  a.  > 0,  w. , i = l...k  be  arbitrary  such  that  E w.  = 1. 

l l i_l  1 

Let  also  E (x,p)  = (z(p)  - cpx)/yp,  i=l...k  be  proper  linear  E.E.F.'s  for  P. 


r~ 
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Then,  so  are  ^ k . 

E(x,p)  = (z(p)  - cpx)/  L ay1 

i=l  1 P 

E(x,p)  = (z(p)  - cpx)/  n a. (y1)  1 

i=l  1 p 

A p t 

and  E (x,p)  = (z(p)  - cK*x)/  max  a.Y 

i=l,k  1 p 

Proof . It  is  obvious  that  multiplying  a proper  E.E.F.  by  1/a  for  a > 0, 
leaves  the  E.E.F.  proper  (Theorem  2).  Therefore,  it  is  enough  to  consider 
the  case  = 1,  i=l,...,k.  Consider  E first.  It  is  enough  to  examine  the 
case  k=2  since  the  general  case  follows  by  induction.  Let  then 

E(x,p)  = (z (p)  - cp-x)/(Yp  + Y^) 

Let  p,  q € P,  s.t.  p ~ q and  let  x 6 F (p)  , x — y relative  to  p -*  q.  We  have 
to  show  that 

E(x,p)  = E (y , q) 

The  statement  is  trivial  if  x € 0(p).  Assume,  then,  that  z(p)  - cP-x  > 0. 
Since  E*  are  proper  for  i = 1,2  we  know  that 

(z(p)  - cP-x)/Yp  = (z(q)  - cq-x)/Yq  i=l,2 

since  the  numerator  of  the  two  expressions  is  non-zero 

yV(z(p)  - cP*x)  = Y^  / (a(q)  - cq-x) 

P q 

Adding  the  two  equations  for  1-1,2,  and  flipping  the  numerator  and  denominator 
back,  we  get  the  required  result  for  E. 

Let  us  now  examine  E with  p,  q,  x and  y as  before.  Since  E*’  are  proper: 

(z(p)  - cp-x)/Yp  = (z(q)  - cq*x)/y*  i=l,...,k 

(z(p)  - cP-x)=  (z(q)  - cq-x)  • (Y1/?1) 

P q 

Raising  each  side  to  the  power  w^,  and  multiplying  the  terms  together,  wc  get: 

i; 


or  since 
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k n wi  k V?i  k , , 

n (z(p)  - cP.x)  = n (z (q)  - cq-x)  . n (y1  | y1) 

_ 1 n 1 1 n 


i = l 


L v = 1 
i=l 


i=l 


(z(p)  - cP • x)  = (z (q)  - cq-x)/  II  (Y„/y„) 


i = l 


i/.  i.wi 


1=1 


p q 


Rearranging  terms,  we  get  the  desired  result. 


Finally,  consider  E.  We  have  to  show  that 

(z(p)  - c P • x) / max  y^  = (z(q)  - cq>y)/  max 


i-1, . . . ,k 


i 1 , . . . ,k 


Let  y^  = max  y*.  It  is  enough  to  show  that  y^  = max  . y*.  Assume, 


P i=l,...,k  P 


q i=l,...,k  q 


l j 

on  the  contrary,  that  y^  > y^  for  some  1 < l < k.  It  then  follows  that 

(z(p)  - cp.x)/y^<  (z(p)  - cP-x)/Yp  = (z(q)  - cq-y)/y^<  (z(q)  - cC 


■y)/y. 


But  then,  since  is  proper. 


(z(p)  - cP-x)/y^  = (z (q)  -cq-y)/y^,  a contradiction. 


Q.E.D. 


Proposition  7 suggests  a strong  relationship  between  normalization  factors  for 
Proper  E.E.F.'s  and  homogeneous  functions  of  degree  1.  This  relationship  is 
further  established  in 

Proposition  8.  Let  PCD.  For  p 6 P,  let  cP  the  cost  vector  of  p.  Let 

1?  be  the  row  space  spanned  by  the  equality  rows,  i £ M , of  A 

P P p 

c be  the  orthogranal  projection  of  c on  L 

L 


Then 

E(x,p)  = ( z (p ) - cP-x)/f(dP) 
is  a proper  E.E.F.  for  p whenever  f(dp)  satisfies: 
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(i)  for  every  i 6 {l,...,N} 


f <dl  * * *di  * * *dn  > = f <di  » • • • "di  • • *dn  ' 


(li)  for  every  i,  j € {l,...,N} 


f(d1P...diP...djP...dnP)  = f(d1P...djP...d.P---dnP) 


(iii)  f(d)  is  homogeneous  of  degree  one,  i.e., 

f(ad1P,ad2P,...,adnP)  = af  (d^, . . . ,dnP) 


Sketch  of  Proof  and  Discussion.  Consider  first  the  case  M = 0,  i.e.,  there 

p 

P P 

are  no  equality  constraints  in  p.  In  this  case,  L and  c are  both  equal  to 

i-i 

(0....0)  and  dP  = cP.  If  M ^ 0,  then  dP  is  an  invariant  under  G type  transforma- 
0 

tions  with  a = 1,  i.e.,  under  additions  of  multiples  of  the  equality  constraints 

0 , 

to  the  objective  row.  The  case  of  a r 1,  (i.e.,  rescaling  of  the  objective  row) 
is  taken  care  of  by  requirement  (iii)  which  assures  that  such  scaling  will 
affect  the  numerator  and  denominator  proportionally.  Condition  (i)  is  designed 
to  accomodate  complementing  variables  (i.e.,  D type  operators).  It  can  be  (fully 
or  partially)  relaxed  if  P itself  excludes  such  operators.  In  other  words,  if 
P does  not  contain  any  two  problems  which  are  obtainable  from  each  other  by  com- 
plementing a given  column,  then  (i)  can  be  relaxed  with  respect  to  this  column. 
For  instance,  one  can  argue  that  for,  say,  travelling  salesman  problems,  the  oper- 
ation of  complementing  variables  is  rather  unnatural  and  therefore  problems 
with  complemented  variables  should  be  excluded  from  P.  In  such  cases,  there  is 
no  need  to  insist  on  the  symetry  imposed  by  (i).  (Allowing  for  complementing 
variables,  however,  comes  in  very  handy  in  the  context  of  enumeration  algorithms). 

Condition  (ii)  ensures  the  invariance  of  E under  permutation  of  columns 
({p  type  operators).  Again,  it  can  be  often  relaxed  in  response  to  the  structure 
of  P.  For  example,  if  P is  the  set  of,  say,  plant  location  problems,  there  is 


—ZJ. 
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a natural  distinction  between  the  variables  which  represent  the  on-off  status  of 
a plant  and  those  which  correspond  to  the  distribution  pattern.  Clearly,  there 
is  no  reason  to  impose  condition  (ii)  between  these  two  sets  of  variables. 
(Strictly  speaking,  in  this  case,  f is  not  a function  of  d alone  since  specific 
entries  in  d are  identified  by  their  respective  columns  in  A.  Nevertheless,  we 
will  refer  to  the  normalization  factor  as  f(d)). 


be  the  sum  of  k largest  elements  (in  absolute  value)  in  the  (modified)  objec- 
tive  row  d.  f (d)  satisfies  (i),  (ii)  and  (iii)  and  is  then  appropriate  for 


a normalization  factor.  For  instance,  if  P is  the  class  of  (binary)  knapsack 
problems,  then  f^(d),  which  in  this  case  reduces  to  |cmaxl  = max|c^|,  is  a proper 
normalization  factor.  Note  that  |c  I represents,  in  a way,  a measure  for  what 
the  algorithm  is  all  about  since  by  rounding  the  optimal  solution  for  the  con- 
tinuous relaxation  of  the  problem  we  get  a solution  which  is  at  most  I cmaxl units 
away  from  the  optimum.  Thus,  E(x,p)  = (z (p) -cPx) /I c | is,  in  my  opinion,  a 
meaningful  measure  for  the  quality  of  a solution  to  the  knapsack  problem.  I am  not 


aware  of  any  approximate  algorithm  for  this  problem  which  can  guarantee  a per- 
formance better  than  the  obvious  level  of  1 with  respect  to  this  measure. 

(2)  If  the  equalities  defining  the  individual  problems  p € P are  of  the 


transportation  type,  then  d can  be  obtained  from  c by  simply  adding  row  and 
column  constants  in  such  a way  that  the  row  and  column  sums  of  d are  zeroes. 
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(This  can  be  accomplished  by  a one  pass  scan  of  the  rows  and  columns  of  c).  If 
we  exclude  from  P problems  in  which  some  of  the  variables  are  complemented,  we 
can  relax  requirement  (i)  of  Proposition  8.  Let  d be  arranged  such  that 


d > d 
ni  ” n2 


> d 
— rr 


and  consider  the  range  of  (modified)  costs 


fR(d)  = d - d 
v ’ rr,  rr 
1 n 

D 

f (d)  measure  the  possible  variability  of  the  per  unit  cost  over  the  different 
routes.  Multiplied  by  the  total  amount  shipped,  it  yields  an  upper  bound  on 
the  difference  in  the  objective  function  between  any  two  feasible  solutions  to 
p,  and  may  be  therefore  meaningful  as  a normalization  factor.  This  measure  can 

be  refined  in  several  ways  by  considering  the  ranges  of  the  individual  rows 
and  columns  and  aggregating  them  as  in  Proposition  7. 


(3)  We  finally  consider  a class  of  normalization  factors  obtained  by 
using  a reference  point,  as  suggested  in  [21.  For  every  p€P,  let  xp€[0,l]  , 
the  Unit  Cube,  be  a reference  point  (not  necessarily  feasible,  not  necessarily 
integer).  Let  the  operators  cp  and  D be  extended  in  a natural  way  to  the  Unit 
Cube  so  it  will  make  sense  to  refer  to  the  correspondence  x -*  y relative  to 
p q,  for  x € [0, 1 ]N  even  though  x £ F (p)  . (In  this  case,  of  course , y £ F (q)  . ) 


Proposition  9.  Let  P c U.  For  each  p ( P, 
(i)  z(p)  t cPxP 
. (ii)  a*?*1’  = b , i (H  i.e. 

ii  p 

constraints  of  p 

P q 

p -*  q =>  x -*  x 


P N 

let  x £ [0,1]  be  such  that: 


p 

, x satisfies  the  equality 


(iii) 


relative  to  p •*  q. 
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Then 

E(x,p)  = (z(p)  - cPx)/|(z(p)  - cPxP)| 
is  a proper  E.K.F.  for  P. 


Proof.  The  proof  resembles  closely  that  of  Proposition  8 (which  was  sketched 
only).  The  only  comment  in  order  is  the  fact  that  it  is  condition  (ii), 
rather  than  feasibility  or  integrality  of  x,  which  enforces  the  invariance 
of  E under  G type  transformations.  Q.E.D. 


There  are  a number  of  ways  to  choose  x to  satisfy  (i),  (ii)  and 
(iii).  Clearly,  one  can  use  the  solution  of  a given  reference  heuristic 

algorithm. for  P which  is  invariant  under  the  transformations  in  question. 

p 

For  instance,  for  the  knapsack  problem,  x can  be  chosen  as  the  solution 
of  the  greedy  (in  the  sense  of  the  cost  to  weight  ratios)  algorithm,  pro- 
vided ties  are  broken  properly.  (E.g.,  breaking  ties  according  to  the 
size  of  the  cost  coefficients  is  o.k. , however,  breaking  them  according  to 
the  index  of  the  variables  in  question  is  not.) 

Alternatively,  a generalization  of  the  origin  can  be  used.  For  instance, 

if  M = 0,  one  can  set 
P 


(1) 


> 0 


< 0 


Similarly, 


if  is  composed  of  a set 


of  disjoint  multiple  choice  constraints: 


L x = k. 

J€s,  J 1 • 


then  x can  be  obtained  by  picking  the  worst  (cost-wise)  k^  variables  from  each 


set  s 
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It  is  interesting  to  note  that  the  traditional  percentage  error  measure 
can  be  viewed  as  using  the  origin  for  a reference  point.  Since  the  origin 
does  not  necessarily  satisfy  the  conditions  of  Proposition  9,  this  measure 
may  turn  out  to  be  improper.  However,  it  can  often  be  modified  in  such  a 
way  so  as  to  make  it  proper  while  still  retaining  some  of  its  basic  proper- 
ties.  This  is  what  is  actually  happening  when  one  uses  x as  defined  in 
( l)  for  the  case  = 0.  Clearly,  if  for  some  p £ P,  > 0 for  all  j £ N, 
then  this  measure  coincides  with  the  percentage  error  measure  for  this 
problem.  However,  the  modified  measure  is  different  from  the  original  one 


if 


some  of  the  c 


j 


are  negative. 
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